1 The Definition of a Stochastic Process 

Suppose that (12, T, P) is a probability space, and that X : 12 —> M is a random variable. 
Recall that this means that hi is a space, T is a cr-algebra of subsets of 12, P is a countably 
additive, non-negative measure on (hi, T) with total mass P(12) = 1, and X is a measurable 
function, i.e., X l (B) = {u G hi : X{u) G B} G T for every Borcl set B G £>(M). 

A stochastic process is simply a collection of random variables indexed by time. It will 
be useful to consider separately the cases of discrete time and continuous time. We will 
even have occasion to consider indexing the random variables by negative time. That is, 
a discrete time stochastic process X = {X n , n = 0,1,2,...} is a countable collection of 
random variables indexed by the non-negative integers, and a continuous time stochastic 
process X = {X t , 0 < t < oo} is an uncountable collection of random variables indexed by 
the non-negative real numbers. 

In general, we may consider any indexing set Jc R having infinite cardinality, so that 
calling X = {X a , a G 1} a stochastic process simply means that X a is a random variable for 
each a G I. (If the cardinality of / is finite, then X is not considered a stochastic process, 
but rather a random vector .) 

There are two natural questions that one might ask. 

(1) How can we construct a probability space on which a stochastic process is defined? 

(2) Is it possible to define a stochastic process by specifying, say, its finite dimensional 
distributions only? 

Instead of immediately addressing these (rather technical) questions, we assume the 
existence of an appropriate probability space, and carefully define a stochastic process on 
that space. In fact, we will defer answering these questions for some time. 

Definition 1.1. Suppose that (12, JP, P) is a probability space, and that I C M is of infinite 
cardinality. Suppose further that for each a G /, there is a random variable X a : 12 —> M 
defined on (12, JP,P). The function X : I x 12 —> M defined by X(a,oj) = X a (u> ) is called a 
stochastic process with indexing set I, and is written A" = {X a , a G I}. 

Remark. We will always assume that the cardinality of / is infinite, either countable or 
uncountable. If I = Z + , then we called X a discrete time stochastic process, and if I = [0, oo), 
then X is said to be a continuous time stochastic processes. 

At first, this definition might seem a little complicated since we are regarding the stochas¬ 
tic process X as a function of two variables defined on the product space / x 12. However, 
this is necessary since we do not always want to view the stochastic process X as a collection 
of random variables. Sometimes, it is more advantageous to consider X as the (random) 
function a i—> X(a,u>) which is called the sample path (or trajectory ) of X at u (and is 
also written X(ou)). We will need to require X as a function of a to have certain regularity 
properties such as continuity or measurability; as will be shown in Example 3.10 below, these 
properties do not come for free! 
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Notation. A word should be said about notation. We have defined a stochastic process 
as a single function X of two variables. That is, X : I x hi —> M is defined by specifying 
(a,u) i—> X(a,co) which mimics the notation from multi-variable calculus. However, we are 
also viewing a stochastic process as a collection of random variables, one random variable 
for each a in the indexing set I. That is, if the random variable X a : hi —> M is defined by 
specifying to > X a (ou), then the stochastic process X is defined as X(a,co) = X Q (u). In 
fact, we will often say for brevity that X = {X a , a G 1} is a stochastic process on (H, X, P). 
Because of this identification, when there is no chance of ambiguity we will use both X(a,co) 
and X a (u>) to describe the stochastic process. If the dependence on u is unnecessary, we 
will simply write X a or even X(a). The sample path of X at u will be written as either 
a i — * X Q (u) or just X{u). 

Example 1.2. Perhaps the simplest example of a stochastic process is what may be termed 
i.i.d. noise. Suppose that X 1 ,X 2 ,... are independent and identically distributed random 
variables on a probability space P) each having mean zero and variance one. The 

stochastic process X : N x H —» M defined by X(n,co) = X n (u) is called i.i.d. noise and 
serves as the building block for other more complicated stochastic processes. For example, 
define the stochastic process S : Id x H —> M by setting 

n 

S(n,LU ) = S n (u) = 

i— 1 

The stochastic process S is called a random walk and will be studied in greater detail later. 

The following section discusses some examples of continuous time stochastic processes. 


2 Examples of Continuous Time Stochastic Processes 


We begin by recalling the useful fact that a linear transformation of a normal random variable 
is again a normal random variable. 

Proposition 2.1. Suppose that Z ~ A/”(0, 1). If a, b E M andY = aZ+b, then Y ~ A f(b, a 2 ). 

Proof. By assumption, the characteristic function of Z is 4>z{u) = K(e' luZ ) = exp(— u 2 /2). 
Thus, the characteristic function of Y is 


Mu) = E (e iuY ) 


e mb E(e iuaZ ) = e lub f z {au) = e 


Jub e - 


( au ) 
2 


exp 



oV j 


which is the characteristic function of a A I(b, a 2 ) random variable. □ 

Note that if a = 0, then Y reduces to the constant random variable Y = b. Also note 
that if Z ~ A/”(0,1), then — Z ~ A/"(0,1). For this reason, many authors restrict to a > 0. 

Exercise 2.2. Show that if Z x ~ A/’( / Ui,o'J) and Z 2 ~ Af(/a 2 ,cr 2 ) are independent, then 
Z\ + Z 2 ~ N{n\ + p 2 , + cr^)- 
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Both of the following relatively straightforward examples of continuous time stochas¬ 
tic processes illustrate the two points-of-view that we are taking of a stochastic process, 
namely (i) a collection of random variables, and (ii) a random function of the index (called 
a trajectory). 

Example 2.3. Consider the probability space (f2,jF, P), and let Z be a random variable 
with P(Z = 1) = P (Z = —1) = 1/2. Define the continuous time stochastic process A" = 
{At, t > 0} by setting 

X t (oj) = Z{u) sin t for all t> 0. 

The two sections of this stochastic process can be described as follows. 

• For fixed t, the section X(t, •) : —> M is the random variable X t which has distribution 

given by 

P(At = sinf) = P(At = — sin t) = 1/2. 

For instance, when t = 37 t/ 4, the random variable A’ 37r / 4 has distribution 

p (* 3 ’ /4 = yi ) = p ( X3 - /4 = = 1/2, 

• For fixed w, the section X(-,oj) : [0, oo) —> R. describes the trajectory, or sample path, 
of X at ui. Note that there are two possible trajectories for A". Namely, if u is such 
that Z(oj) = 1, then the trajectory is 1 1 —^ sin t, and if u is such that Z(u) = —1, then 
the trajectory is t i—^ — sinf. Of course, each trajectory occurs with probability 1/2. 

Example 2.4. Suppose that Z ~ A/”(0,1), and define the continuous time stochastic process 
X = {X t , t > 0} by setting 

X t {u>) = Z(cj) sinf for all t > 0. 

In this example, the two sections of the stochastic process are slightly more complicated. 

• If Z ~ A/”(0,1), then it follows from Proposition 2.1 that (sin t) ■ Z A/”(0, sin 2 1 ). 

Therefore, for fixed t, the section X(t, •) : D —> M is the random variable X t which has 
the A/”(0,sin 2 t) distribution. For instance, if t = 37t/ 4, then A" 37r / 4 AT(0,l/2). 

• As in the previous example, for fixed w, the section X(-,ou) : [0, oo) —> M describes the 
trajectory of X at c o. This time, however, since Z is allowed to assume uncountably 
many values, there will be infinitely many possible trajectories. Each trajectory 1 1 —> X t 
will simply be a standard sine curve with amplitude Z(lu). That is, once Z is realized 
so that the number Z{u) is known, the trajectory is t ^ Z(uj) sin(t). For instance, if 
Z(u) = 0.69834, then the trajectory is 1 1 —^ 0.69834 sin t. Of course, since P(Z = z) = 0 
for any zel, any given trajectory occurs with probability 0. 

Notice that in both of the previous two examples, the trajectories of the stochastic 
process X were continuous. That is, the trajectories were all of the form t i—>• Z sin t for 
some constant Z , and Z sin t, as a function of t, is continuous. 

The next example is also of a continuous time stochastic process whose trajectories are 
continuous. It is, however, significantly more complicated. 
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Example 2.5. Consider a collection of random variables {B t , t > 0}, having the following 
properties: 

• -So — 0, 

• for 0 < s < t < oo, B t — B s ~ A/”(0, t — s), 

• for 0 < s < t < oo, B t — B s is independent of B s , 

• the trajectories 1 1 —>• B t are continuous. 

The stochastic process B = {B t , t > 0} is called Brownian motion and is of fundamental 
importance in both the theory, and applications, of probability. 

It is actually a very deep result that there exists a stochastic process having these proper¬ 
ties (continuous trajectories is the tough part). One way to prove the existence of Brownian 
motion is to take an appropriate limit of appropriately scaled simple symmetric random 
walks. (This concept of a scaling limit is of fundamental importance in modern probability 
research.) 

Remark. The history of Brownian motion is fascinating. In the summer of 1827, the Scot¬ 
tish botanist Robert Brown observed that microscopic pollen grains suspended in water 
move in an erratic, highly irregular, zigzag pattern. Following Brown’s initial report, other 
scientists verified the strange phenomenon. Brownian motion was apparent whenever very 
small particles were suspended in a fluid medium, for example smoke particles in air. It was 
eventually determined that finer particles move more rapidly, that their motion is stimulated 
by heat, and that the movement is more active when the fluid viscosity is reduced. 

However, it was only in 1905 that Albert Einstein, using a probabilistic model, could 
provide a satisfactory explanation of the Brownian motion. He asserted that the Brownian 
motion originates in the continual bombardment of the pollen grains by the molecules of 
the surrounding water, with successive molecular impacts coming from different directions 
and contributing different impulses to the particles. As a result of the continual collisions, 
the particles themselves had the same average kinetic energy as the molecules. Thus, he 
showed that Brownian motion provided a solution (in a certain sense) to the famous partial 
differential equation u t = u xx , the so-called heat equation. 

Note that in 1905, belief in atoms and molecules was far from universal. In fact, Einstein’s 
“proof” of Brownian motion helped provide convincing evidence of atomic existence. Einstein 
had a busy 1905, also publishing seminal papers on the special theory of relativity and the 
photoelectric effect. In fact, his work on the photoelectric effect won him a Nobel prize. 
Curiously, though, history has shown that the photoelectric effect is the least monumental of 
his three 1905 triumphs. The world at that time simply could not accept special relativity! 

Since Brownian motion described the physical trajectories of pollen grains suspended in 
water, Brownian paths must be continuous. But they were seen to be so irregular that the 
French physicist Jean Perrin believed them to be non-differentiable. (The German math¬ 
ematician Karl Weierstrass had recently discovered such pathological functions do exist. 
Indeed the continuous function 

OO 

g( x ) = b n cos(a n 7:x ) 

n— 1 
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where a is odd, b G (0,1), and ab > 1 + 3n/2 is nowhere differentiable.) Perrin himself 
worked to show that colliding particles obey the gas laws, calculated Avogadro’s number, 
and won the 1926 Nobel prize. 

Finally, in 1923, the mathematician Norbert Wiener established the mathematical exis¬ 
tence of Brownian motion by verifying the existence of a stochastic process with the given 
properties. 

Exercise 2.6. Deduce from the definition of Brownian motion that for each t, the random 
variable B t is normally distributed with mean 0 and variance t. Why does this implies that 
E(^ 2 )=t? 

Exercise 2.7. Deduce from the definition of Brownian motion that for 0 < s < t < oo, the 
distribution of the random variable B t — B s is the same as the distribution of the random 
variable B t _ s . 

Exercise 2.8. Show that Co v(B t , B s ) = min(s,f). (Hint: Write B s B t = (B s B t — B 2 S ) + B 2 , 
take expectations, and then use the third part of the definition of Brownian motion and 
Exercise 2.6.) 

3 Regularity Properties of Continuous Time Stochas¬ 
tic Processes 

We now define what it means for a continuous time stochastic process to be continuous. 

Definition 3.1. A continuous time stochastic process A" = {X t , t > 0} is said to be 
stochastically continuous (or continuous in probability ) for every t > 0, and for every e > 0, 

limP(|Ah — X s \ > e) = 0. 

S—>t 

Definition 3.2. A continuous time stochastic process A" = {Ah, t > 0} is said to be 
continuous, or to have continuous sample paths , if 1 1 —^ X t (u) is continuous for all u. 

Remark. The definitions of a continuous time stochastic process having right-continuous, 
or left-continuous, sample paths are analogous. 

Next we consider what it means for two stochastic processes to be equal. In the presence 
of a probability measure, there are three natural, but related, ways to define sameness. 

Definition 3.3. Suppose that X = {X a , a G 1} and Y = {Y a , a G 1} are two stochastic 
processes defined on a common probability space (D,JF, P). We say that Y is a version of 
X if for every a G /, we have 

P(X Q = Y a ) = P({ca G n : X a (u) = WM}) = 1- 

We say that X and Y are indistinguishable if 

P(AA = Y a V a G /) = P({w G Q : X a (u) = Y a (u) V a G /}) = 1. 
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In other words, X and Y are indistinguishable if 


{cu G D : X a (u) Y a (u) for some a G 1} 


is a P-null set. 


Remark. Some texts use the term modification in place of version. 

If two processes are indistinguishable, then they are trivially versions of each other. How¬ 
ever, the distinction between version and indistinguishable can be subtle since two processes 
can be versions of each other, yet have completely different sample paths. This is illustrated 
with the following example and exercises. 

Example 3.4. Suppose that Z is an absolutely continuous random variable on the proba¬ 
bility space (H, T, P). (For example, suppose Z ~ A/”(0,1).) Define the stochastic processes 
X and Y on the product space [0, oo) x O by setting X t = 0 and 


Y t = 



if |Z|, 
if t= \Z\. 


We see that Y is a version of X since for every t > 0, 


F{X t Y t ) = F(\Z\ = t)= P (Z = t) + P (Z = -t) = 0 
giving P(X t = Y t ) = 1. On the other hand, X and Y are not indistinguishable since 

F(X t = Y t V t > 0) = 0. 


A similar example is provided by the next exercise. 

Exercise 3.5. Suppose that D = [0,1], T are the Borel sets of [0,1], and P is the uniform 
probability (i.e., Lebesgue measure) on [0,1], and assume that T is complete with respect 
to P. For t G [0,1], and u G [0,1], define X t (u>) = 0 and Y t (u) = l{t = to}. Show that X 
and Y are versions of each other, but that they are not indistinguishable. 

Thus we see that if two processes are indistinguishable, then they will necessarily have 
a.s. indistinguishable sample paths. However, the same is not true if Y is only a version of 
X. In both of the previous two instances, t i—■> X t has constant sample paths, but t i—>• Y t 
has discontinuous sample paths. 

The following exercise gives a partial converse. 

Exercise 3.6. Suppose that Y is a version of X, and that both X and Y have right- 
continuous sample paths. Show that X and Y are indistinguishable. 

In order to define both version and indistinguishable, it was necessary that both X and 
Y be defined on the same probability space. For our final definition of sameness, this is not 
necessary. 
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Definition 3.7. Suppose that X = {X a , a G 1} is a stochastic processes defined on a 
probability space P), and that Y = {Y a , a G 1} is a stochastic processes defined 

on a probability space P). We say that X and Y have the same finite dimensional 

distributions if for any integer n > 1; for any distinct indices ai, a 2 , ■.., a n , each a* G /; and 
for every Borcl set A G Z3(R), we have 

P( (X aii .. .,X an ) eA) = F( (Y ai , A). 

Exercise 3.8. Suppose that A" and E are stochastic processes both defined on a common 
probability space P). Show that if Y is a version of X, then X and Y have the same 

finite dimensional distributions. 

In order to further study the sample path properties of a stochastic process, it will be 
convenient for it to have some joint measurability properties. 

Definition 3.9. Suppose that (hi, T, P) is a probability space. The stochastic process X = 
{X a , a G /}, defined by X : I x —> R where X(a,u>) = X a (ou), is said to be measurable if 
for each u G hi the section X(-,u>) : I — > R is a measurable function (called the sample path 
or trajectory of X at u). 

Remark. In other words, a stochastic process is measurable if the function 

X : (/ x a B(I) (R, B(R)) 

is measurable, where £>(/) ® T = a(B(I),lF) is the product a- algebra. Recall from [5, 
Theorem 10.2] that if X is (jointly) measurable, then each of its sections is necessarily 
measurable. 

For the benefit of the reader, we summarize the definition of measurable stochastic process 
emphasizing the measurability of the sections. That is, if A :/xfl—>M is a measurable 
stochastic process with indexing set / on the probability space (fl,jF, P), then 

• for each a G I the section X(a, •) : —» R is a random variable, and 

• for each oj G the section X(-,ca) : I — > R is a measurable function (called the sample 
path or trajectory of X at oj). 

The first condition above is simply a restatement of the definition that a stochastic process 
is a collection of random variables. It is the second condition that gives the definition of 
measurable stochastic process its substance. 

Not all stochastic processes are measurable, and the following example shows that the 
measurability of a stochastic process cannot be taken for granted. 

Example 3.10. Suppose that for each t G [0,1], the random variable X t is uniformly 
distributed on [—1,1], Suppose further that the collection of random variables {X t , t G [0,1]} 
is independent. We now show that X = {X t , t G [0,1]} is not measurable. To the contrary, 
suppose that X is measurable and for t G [0,1], define Y t by 

Y t = f X s ds. 

Jo 
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It is not too difficult to show that each Y t is a random variable. Furthermore, the function 
t i—> Y t (uj) is continuous for all u. Now, if P (Y t ^ 0) = 0 for all t G [0,1], then Y t = 0 for all 
rationals a.s., and therefore Y t = 0 for all t a.s. by path continuity. We therefore conclude 
that X t = 0 for almost all t a.s., and hence a.s. for all t. Thus, there must exist a f 6 [0,1] 
such that P(Y t ^ 0) > 0. This implies that E (Y t 2 ) > 0. However, 



which is a contradiction to the assumption that X is measurable. 

We end this section with an important result which tell us when a stochastic process 
has a measurable version, and a continuous version. Suppose that X is a continuous time 
stochastic process with indexing set / = [0, oo), and let T > 0. For every 5 > 0, the modulus 
of continuity of the sample path X{u) on [0, T] is 

mT(X(u>),8) = sup{ | X t (u) — Ah.(ca)| : |s — t\ < 8, 0 < s,t < T}. 

Theorem 3.11. Suppose that X is a continuous time stochastic process. 

• If X is stochastically continuous, then X has a measurable version. 

• X has a continuous version if and only if (i) X is stochastically continuous, and (ii) 
mriX(u>), 8) —> 0 in probability as 8 ^ 0. 

Further Reading. Further foundational results about stochastic processes in continuous 
time, including the proof of Theorem 3.11, may be found in [1, Chapter 6]. 
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